Faults in Grids: Why are they so bad and What can be done about it?
نویسندگان
چکیده
Computational Grids have the potential to become the main execution platform for high performance and distributed applications. However, such systems are extremely complex and prone to failures. In this paper, we present a survey with the grid community on which several people shared their actual experience regarding fault treatment. The survey reveals that, nowadays, users have to be highly involved in diagnosing failures, that most failures are due to configuration problems (a hint of the area’s immaturity), and that solutions for dealing with failures are mainly application-dependent. Going further, we identify two main reasons for this state of affairs. First, grid components that provide high-level abstractions when working, do expose all gory details when broken. Since there are no appropriate mechanisms to deal with the complexity exposed (configuration, middleware, hardware and software issues), users need to be deeply involved in the diagnosis and correction of failures, when, in fact, all they want is to run their applications. One needs a way to coordinate different support teams working at the grids different levels of abstraction. Second, fault tolerance schemes today implemented on grids tolerate only crash failures. Since grids are prone to more complex failures, such as heisenbugs, one needs to tolerate tougher failures. Our hope is that the very heterogeneity, that makes a grid a complex environment, can help in the creation of diverse software replicas, a strategy that can tolerate more complex failures.
منابع مشابه
Competition in Healthcare: Good, Bad or Ugly?
The role of competition in healthcare is much debated. Despite a wealth of international experience in relation to competition, evidence is mixed and contested and the debate about the potential role for competition is often polarised. This paper considers briefly some of the reasons for this, focusing on what is meant by “competition in healthcare” and why it is more valuable to think about th...
متن کاملWhy we need to read and understand literature: literariness and Hans Rosling’s Factfulness (2018)
My article addresses the qualities of “good” literature and how an understanding of the nature of literary devices, so-called “literariness”, can enhance the reading experience. Focusing on Hans Rosling’s Factfulness (2018), I discuss some of the most important features of good writing. Six literary devices have been selected for special attention: point of view, tone, amplification, anecdotes,...
متن کاملI-17: Recent Developments in Animal Welfare Sciencee
The methodology for the scientific assessment of animal welfare has developed rapidly in recent years and has become a major scientific discipline. The concepts of welfare, need, stress, health, pain, emotion and feeling are now clarified. In teaching, legislation and practical work referring to animal welfare, a clear definition that can be related to other concepts is needed. The welfare of a...
متن کاملWhy Studying Rare Diseases Is So Important: Do You Know of This Disease?
Some diseases affect many people and are of course very bad. Some other diseases affect just a few people and are called rare diseases. This sounds like a good thing but... if you are one of the few people affected by a rare disease, this is not a good thing! It is actually very bad because, often, pharmaceutical companies are not interested in developing a treatment for you because too few peo...
متن کاملAn Empirical Study about Why Dissatisfaction Arises Among the Employees and What It Consequences: Bangladesh Perspective
This article aimed at identifying the rate of dissatisfied employees who had left their previous jobs and the main factors which caused their dissatisfaction. In order to collect data for this study a well-structured questionnaire was distributed to 150 employees of different private and public organizations in Bangladesh who already left their previous jobs and 142 usable responses were rec...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003